Estimating confidence using word lattices
نویسندگان
چکیده
For many practical applications of speech recognition systems, it is desirable to have an estimate of con dence for each hypothesized word, i.e. to have an estimate which words of the speech recognizer's output are likely to be correct and which are not reliable. Many of today's speech recognition systems use word lattices as a compact representation of a set of alternative hypothesis. We exploit the use of such word lattices as information sources for the measure-of-con dence tagger JANKA [1]. In experiments on spontaneous human-to-human speech data the use of word lattice related information signi cantly improves the tagging accuracy.
منابع مشابه
K Eyword S Potting on W Ord L Attices
In spite of its numerous potential applications, Automatic Speech Recognition (ASR) remains a difficult (and mainly unsolved) problem. In addition to the intrinsic difficulty of the task, users tend to go beyond the pre-defined lexicon words, and the important keywords necessary to understand voice requests are often lost in extra words. In this context, it is often interesting to develop Keywo...
متن کاملLarge vocabulary decoding and confidence estimation using word posterior probabilities
This paper investigates the estimation of word posterior probabilities based on word lattices and presents applications of these posteriors in a large vocabulary speech recognition system. A novel approach to integrating these word posterior probability distributions into a conventional Viterbi decoder is presented. The problem of the robust estimation of confidence scores from word posteriors ...
متن کاملA general algorithm for word graph matrix decomposition
In automatic speech recognition, word graphs (lattices) are commonly used as an approximate representation of the complete word search space. Usually these word lattices are acyclic and have no a-priori structure. More recently a new class of normalized word lattices have been proposed. These word lattices (a.k.a. sausages) are very efficient (space) and they provide a normalization (chunking) ...
متن کاملHigh-level approaches to confidence estimation in speech recognition
We describe some high-level approaches to estimating confidence scores for the words output by a speech recognizer. By “high-level” we mean that the proposed measures do not rely on decoder specific “side information” and so should find more general applicability than measures that have been developed for specific recognizers. Our main approach is to attempt to decouple the language modeling an...
متن کاملUsing Sub-word-level Information for Confidence Estimation with Conditional Random Field Models
The task of word-level confidence estimation (CE) for automatic speech recognition (ASR) systems stands to benefit from the combination of suitably defined input features from multiple information sources. However, the information sources of interest may not necessarily operate at the same level of granularity as the underlying ASR system. The research described here builds on previous work on ...
متن کامل